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Abstract 

The Densest fc-subgraph problem (i.e. find a size k subgraph with maximum number of 
edges), is one of the notorious problems in approximation algorithms. There is a significant 
gap between known upper and lower bounds for Densest fc-subgraph: the current best algorithm 
gives an fa 0{n}^^) approximation, while even showing a small constant factor hardness requires 
significantly stronger assumptions than P ^ NP. In addition to interest in designing better 
algorithms, a number of recent results have exploited the conjectured hardness of Densest fc- 
subgraph and its variants. Thus, understanding the approximability of Densest fc-subgraph is 
an important challenge. 

In this work, wc give evidence for the hardness of approximating Densest fc-subgraph within 
polynomial factors. Specifically, we expose the limitations of strong semidefinite programs from 
SDP hierarchies in solving Densest fc-subgraph. Our results include: 

• A lower bound of log'^ n) on the integrality gap for f7(logn/loglogn) rounds of 
the Sherali- Adams relaxation for Densest fc-subgraph. This also holds for the relaxation 
obtained from Sherali- Adams with an added SDP constraint. Our gap instances are in 
fact Erdos-Renyi random graphs. 

• For every e > 0, a lower bound of n'^/^'^-'^ on the integrality gap of n^^*^^ rounds of the 
Lasserre SDP relaxation for Densest fc-subgraph, and an n^=(i) gap for n^~'^ rounds. Our 
construction proceeds via a reduction from random instances of a certain Max;-CSP over 
large domains. 

In the absence of inapproximability results for Densest fc-subgraph, our results show that beating 
a factor of n^*^^^ is a barrier for even the most powerful SDPs, and in fact even beating the best 
known 'n}/^ factor is a barrier for current techniques. 

Our results indicate that approximating Densest fc-subgraph within a polynomial factor 
might be a harder problem than Unique Games or Small Set Expansion, since these problems 
were recently shown to be solvable using n^"'^' rounds of the Lasserre hierarchy, where e is the 
completeness parameter in Unique Games and Small Set Expansion. 
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1 Introduction 



The densest /c-subgraph problem takes as input a graph G{V, E) on n vertices and a parameter 
k, and asks for a subgraph of G on at most k vertices having the maximum number of edges. 
While it is a fundamental graph optimization problem and arises in several applications (community 
detection in social networks, identifying protein families and molecular complexes in protein-protein 
interaction networks, etc), there is a huge gap between the best approximation algorithm and the 
known inapproximability results. The current best approximation algorithm due to [BCC^IO] gives 
0(n^/'^+'^)-factor approximation algorithm which runs in time ■nP^'^/^) for any constant e > 0. On 
the inapproximability side, [Fci02] initially showed a small constant factor inapproximability for 
Densest A;-subgraph using the random 3-SAT assumption. [Kho04] used quasi-random PCPs to rule 
out a PTAS. More recently, [RSIO, AAM^IO] used more non-standard assumptions to rule out any 
constant factor approximation algorithms. 

While only constant factor approximations have been ruled out, it is commonly believed that 
Densest fc-subgraph is much harder to approximate even on average (for a natural distribution on 
hard instances). Recently, average-case hardness assumptions based on the hardness of "planted" 
versions of Densest A;-subgraph were used for public key cryptography [ABW08] and in showing that 
financial derivates can be fraudulently priced without detection [ABBGIO]. Given the interest in 
Densest /c-subgraph from both the algorithms and the complexity point of view, developing a better 
understanding of the problem is an important challenge for the field. 

In this work, we study lift-and-project relaxations for Densest /c-subgraph. Lift-and-project 
methods are systematic iterative procedures to obtain sequences of increasingly stronger mathe- 
matical programming relaxations for an integer optimization problem (e.g. Lovasz-Schrijver [LS91], 
Sherali- Adams [SA90] and Lasserre [LasOl]. See the survey by Laurent [Lau03] for a comparison). 
Typically, the relaxation obtained after r levels of these strengthenings can be solved in -nP^"^^ time. 
A number of recent papers have studied the strength and limitations of such relaxations as a basis for 
designing approximation algorithms for various problems [ABLT06, CMM09, Chl06, CS06, dVM07, 
GMPT07, GMT09, KS09, RS09, Sch08, STT07a, STT07b, Tul09] (see the recent survey by Chlam- 
tac and Tulsiani [CTll]). In most cases of approximation algorithms that use strengthened LP and 
SDP relaxations, such relaxations can be obtained from a few levels of such lift-and-project proce- 
dures. In fact, the 0{'n}/^~^^) approximation algorithm of [BCC^IO] for Densest A;-subgraph uses 
a linear programming relaxation which is weaker than that obtained from 0(l/e) levels of the 
Sherali-Adams hierarchy.^ [BCC+10] also show that the integrality gap becomes 0(n^/^~^) after 
rpi'^) levels of the Sherali Adams LP hierarchy. 

Our results. In this paper, we first study lift-and-project relaxations for Densest A;-subgraph ob- 
tained from the Sherali-Adams hierarchy. We show that even 0( ^J°^^^^ ) levels of the Sherali-Adams 

relaxation have an integrality gap of ^l{^n}/^). Then, we turn to the Lasserre hierarchy for Densest 
fc-subgraph. We show an integrality gap of polynomial ratio {rf, for small enough constant e) for 
almost linear (n^"*-^^'^)) levels of the Lasserre relaxation. If we only aim at an integrality gap for 
polynomial (n*^) levels of the Lasserre relaxation, the ratio of the gap can be as large as Tn?/^'^~^^''\ 
Our gap instances are actually (Erdos - Renyi) random graph instances Q{n,p) and random 
bipartite graphs under a special distribution - hence, we show that natural distributions of instances 
are integrality gap instances with high probability. 

'^[BCC^IO] also gives a purely combinatorial algorithm that does not use a linear program. 
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We note that prior results exhibiting gap instances for lift-and-project relaxations do so for 
problems that are already known to be hard to approximate under some suitable assumption; based 
on this hardness result, one would expect lift-and-project relaxations to have an integrality gap that 
matches the inapproximability factor. 

Our gap constructions for Densest /c-subgraph in this paper are a rare exception to this trend, 
as the integrality gaps we show are substantially stronger than the (very weak) hardness bounds 
known for the problem. In fact, we are only aware of the following examples where a polynomial- 
round Lasserre integrality gap stronger than the corresponding NP-hardness result is known : Max 
X-CSP, /C-coloring [Tul09], Balanced Separator and Uniform Sparsest Cut [GSZll]. In the first 
two cases, NP-hardness results that are not that far from the gaps are known [STOO, Kho02] and for 
Max ivT-CSP a matching Unique-Games hardness is also known [ST09]. For the other two problems, 
constant factor integrality gaps were shown for linear number of rounds of Lasserre hierarchy 
[GSZll]. Again, while these problems are not known to be APX-hard, under the conjectured 
intractability of Small Set Expansion, they are known to be hard to approximate within any constant 
factor. 

In the absence of inapproximability results for Densest A;-subgraph, our results show that beating 
a factor of n^^^^ is a barrier for even the most powerful SDPs, and in fact even beating the best 
known n^/^ factor is a barrier for current techniques. These results are perhaps indicative of the 
hardness of approximating Densest A;-subgraph within n^^^^ factors. 

Relation to the Small Set Expansion and Unique Games. A problem related to Densest k- 
subgraph is the Small Set Expansion (SSE) problem, which has received a lot of recent attention due 
to strong connections to the Unique Games conjecture [RSIO]. One way to state the SSE conjecture 
[RSIO] (which is known to imply the Unique Games conjecture) is as follows: for all e > 0, there 
exists 6,D (think of D as a constant), such that the following problem is not polynomial-time 
solvable: 

Definition 1.1 (The Gap-SSE problem). Given a D-regular instance G(y,E) with k = 6n, the 
Gap-SSE problem is to distinguish between the following two cases. 

• Yes case. There exists a subgraph of k vertices with average degree at least (1 — £)D. 

• No case. All subgraphs of k vertices have average degree at most eD. 

Clearly, Densest fc-subgraph is hard to approximate within any constant factor, assuming the 
Small Set Expansion conjecture. On the other hand, our results indicate that approximating Densest 
A:-subgraph even within a polynomial factor may be a harder problem than Unique Games or Small 
Set Expansion, because these problems were recently shown to be solvable using n*^"'^' rounds of 
the Lasserre hierarchy, where e is the completeness parameter in Unique Games and Small Set 
Expansion [BRSll, GSll]. 

2 Preliminaries 
2.1 Notation. 

We introduce some notation which will be used throughout the paper. G = (V, E) refers to a graph 
which is an instance of the Densest /c-subgraph problem on n vertices, and k refers to the size of 
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the subgraph we are required to output. For an induced subgraph H C G, we denote by d{H) the 
average degree (or density of H). For a vertex v in subgraph H, we will denote by T}{{v) the set 
of neighbors of in if (the suffix will be dropped when H = G). 

The phrase "with high probability" will mean: with probability 1 — ^^^j for any polynomial 
p{n). 

2.2 The relaxation hierarchies for Densest fc-subgraph. 

We will be concerned with the SDP relaxations derived from the Sherali-Adams and Lasserre 
hierarchies for Densest A;-subgraph. As in other lift-and- project schemes, a feasible solution to r 
levels of these hierarchies satisfies the condition that for any set of r vertices, it defines a valid 
distribution over integral solutions for these vertices - in particular, the integrality gap becomes 
1 after n levels. Further, the relaxations given by r levels of the Sherali-Adams and Lasserre 
hierarchies can be solved in rfi^'^^ time. We are interested in the integrality gap of r levels of these 
relaxations for Densest fc-subgraph. Refer to [CTll] for a more comprehensive comparison of these 
relaxation hierarchies. 

2.2.1 The Sherali-Adams LP hierarchy. 

The Sherali-Adams hierarchy starts with a simple LP relaxation of a {0, 1} integer program, and 
obtains a sequence of successively tighter relaxations with more levels. The natural LP relaxation 
for Densest fc-subgraph (LPl in Figure 1) [SW98, FS97] has variables {xj} to denote if vertex i 
belongs to the solution, and edge variables {xij} {ij)^E{G) to denote if both i,j are in the subgraph. 
This LP has an integrality gap of ([FKP01,'fS97]). 



Natural LP (LPl): 


Min. degree LP (LP2): 


max Xij 

(i,3)&E{G) 
s.t. Xi <k 

Vz,j G y, < Xij <Xi<\ 


max d 
s.t. Xi < k, and 

^{xij ^ V} s.t. 

\/i £ V Xij > dxi 
jer(i) 

Vi, j G V Xij = Xji 

£ V < Xij < Xi < 1 



Figure 1: Two Linear Programming relaxations for Densest fc-subgraph 

For our integrality gaps, we will in fact start with a stronger basic (first-level) linear program 
(LP2 in Figure 1) which is equivalent upto a factor of 2 (see [BCC+10]). Intuitively, it tries to find 
a A;-subgraph H where the minimum degree dn is maximized. An LP hierarchy obtained from this 
min. degree LP (LP2) was in fact used by [BCC+10] to obtain their approximation algorithm. ^ 

^While the program as stated is not linear, we guess the degree d and consider the feasibility linear program that 
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Let us consider strengthening this LP by considering r levels of the Sherali- Adams hierarchy 
{SAr, shown in Figure 2). In the lifted LP, the variable xs is supposed to capture whether every 
vertex in S belongs to the chosen /c-subgraph (i.e., xs = Hies ^«)- Further if we take two sets S, S' 
of < r vertices, the local distributions induced by a feasible solution (using the inclusion-exclusion 
constraints), agree on the variables in the intersection S D S' . We follow the notation established 
in [CTll] while defining the hierarchy. 

2.2.2 The mixed hierarchy (SheraU- Adams + SDP). 

The mixed hierarchy (also refered to as SA+) imposes an additional SDP constraint on top of the 
Sherali-Adams LP relaxation. In particular, it asks for the values Xij to come from vector inner 
products i.e. the matrix X = (xij) is p.s.d. Most known algorithms which proceed by rounding a 
relaxation obtained from an SDP hierarchy [Chl06, CS06, BRSll] work with this mixed hierarchy 
■^.[RSOQ, KS09] and [GMT09] considered this hierarchy and obtained integrality gaps for Unique 
Games and approximation-resistant CSPs. 

max d, s.t. 

3{xs \ S C V,\S\ < r} s.t. xijj = 1 and 

yS,TCV s.t |5| + |r| < r : 

^ ^(-l)l^lx5uju« < kY,{-lf\xsuJ (1) 

iey JCT JCT 

^^^^ E E(-1)''"^5UJU{M}^^E(-1)'^'^5UJUW (2) 
ier{i) JCT JCT 

0< ^(-l)l''lx5uj<l (3) 

JCT 

Figure 2: Sherali-Adams LP relaxation (r levels) for Densest fc-subgraph: SA^ 

One level of the mixed hierarchy for Densest A;-subgraph gives the SDP relaxation introduced 
in [FS97, SW98]. [BCC+10] show that the mixed hierarchy performs better than log-density based 
arguments (which are captured by just the LP hierarchy) in a planted model. ^ It is interesting in 
this light to obtain integrality gaps for mixed hierarchy. 

2.2.3 The Lasserre hierarchy. 

The Lasserre hierarchy produces a sequence of SDP relaxations which are stronger than the Sherali- 
Adams and the mixed hierarchies. As in [CTll], the r-level Lasserre SDP for Densest A;-subgraph in- 
troduces a vector Us for each subset 5 C 1/ with |5| < r (Figure 3). The intended solution sets 
Us = Uijf if every vertex in S belongs to the densest A:-subgraph, and L^^ = otherwise. The vector 
lengths llt/s'll^ correspond to valid LP values xs for the Sherali-Adams relaxation presented above. 

is obtained. 

^[GSll] is an exception and seems to need a relajcation given by the Lasserre hierarchy. 

*In particular, the problem of detecting if dense fc-subgraph is planted in a random graph or not, in the parameter 
range D < n}^'^ . 
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max ^ ||C/{„^^}||^ 
such that 

{Us„Us,)>OioT all Si,S2 

{Us„Us,) = {Us,, Us,) when 5i U S2 = ^3 U ^4 

^{U{^}, Us) < k \\Usf for ah S 

Figure 3: Lasserre hierarchy (r levels) for Densest /c-subgraph 

Remark 2.1. As in Section 2.2.1, we can write an SDP which tries to find the k- subgraph of 
largest induced minimum degree d. This can he captured by the SDP constraint (analogous to [2)) 

V5, Vn eV, Yl i^M^^s) > d ■ Us), (4) 

v&{u) 

However, we show in Section 4-3 that our integrality gaps also hold for the Lasserre hierarchy 
defined by this SDP. We refer to the SDP with constraint (4) as the Min degree Lasserre SDP . 

3 Integrality Gap for the Sherali- Adams hierarchy 

In what follows L will denote the number of levels of the hierarchy we will consider. 

Theorem 3.1. Let L < 10 /og log n • integrality gap of SA^ is at least Q ( ^^J^i ) ■ 

To prove Theorem 3.1, we present instances G where the relaxation has a solution with value 
d = Vl{n}/^/L), while the integer optimum, i.e., the largest density of a /c-subgraph in G is only 
O(log'^n). It will be notationally convenient to construct gaps for L/2 levels. 

3.1 The instance. 

We in fact give a distribution over instances, and prove that the desired gap holds with high 
probability. The instances we consider are Q{n,p) random graphs with p = n~^/^logn (thus the 
expected degree of each vertex is D = n^/^logn). The parameter k is chosen to be n^/"^. An 
easy calculation shows that in any k subgraph, the density (and hence the min-degree) is at most 
0(log^ n) (see full version or [BCC^IO, FKPOl]). The meat of the argument is thus to show that 
there exists an LP solution to (Equations (l)-(3)) of value d = ^}{n^^^ / L) even for L of the 

order log n/ log log n. 

The following are the properties of the distribution Q{n,p) (with above parameters) we will 
truly be using [see Section A for proofs]. Any graph with these properties admits the solution to 
SA^/2 which we describe. 

1. Every vertex has degree between (n^/^logn)/2 and 2n^^'^\ogn. 
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2. Any two vertices i,j have at least one common neighbor and has at most 0(log^ n) common 
neighbours. 

3.2 Feasible solution. 

Before formally giving the xs values, we give intuition as to what they ought to be. First, we start 
out setting Xi = n~^/^ (equal for all vertices, since Xi < k = n^l'^ and no vertex is special). Next, 
suppose S <ZV with i ^ S and think of d n^/^. Now (2) implies that X]jer(i) ^^uj ^ n^l'^xs- 
Further from (1), we obtain Xljey ^suj ^ ri}-l'^xs- Thus we conclude that xs\jj must be roughly 
for j G r(5), while for j r(S'), it should be only n~^/^X5. Now consider T C V which 
span a tree: we could imagine starting with one vertex and adding vertices one by one (each added 
vertex is a neighbour of the previous ones), and thus conclude that xt is roughly (since 
Xi = n~^/^ to begin with). Now let S be an arbitrary set of vertices and consider a tree T ^ S: by 
monotonicity (a corollary of (3)), xs > xt, and since this is true for every such T, we need to set 
Xs to be at least n~^^^^^^~^^^/'^, where st(5) is the number of vertices (size) in the minimum Steiner 
tree of S. 

These, with additional 'dampening' factors (L-terms), are precisely the values we will set. More 
precisely we consider the solution 

X5 = n-i-(^*(^)+i).L-l^l, (5) 

where st(5'), as above, is the size of the minimum Steiner tree of S. Thus for instance Xi = n~^/^/L, 
while x^ijj = l/{v?/'^L?') when (i,j) G E and l/(nL^) otherwise (the latter is because there is a 
path of length-2 between any i,j€zG with high probability). 

Let us fix L < log n/ (10 log log n). We now show that the LP solution presented above is feasible 
for with high probability. The following lemma is useful in simplifying the analysis: it implies 

that we need to only consider T = while showing that the LP solution satisfies constraints (1) 
and (2). This is where the 'dampening' factors come into play. 

Lemma 3.2. Let S, T be disjoint subsets of V of size at most t and xs be the solution described 
above. Then 

JCT 

Proof. One property of the assignment (5) is that xsui ^ ^s/L for i ^ S. Further all the xs are 
> 0, and thus in the sum above, the term corresponding to J C T contibutes positively when \J\ 
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is even and negatively otherwise. Hence, 

)\J\ 



JCT 



- Yl (^5uj- Yl ^suju{i}) 

i=o JCT ier\5 
\j\=2e 

> E E xsuj{l-\T\/L) 



=0 JCT 

\j\=2e 



^ E 9 • E "^SUJ > f (since |r| < L/2) 



2 

^=0 JCT 
\J\=2£ 

A similar proof shows the upper bound, since the xsij{i} terms for i £ T dominate the contributions 
of for |J| > 1. □ 

Corollary 3.3. In checking feasibility, it suffices to check (1) and (2) with T = 0. 

Proof. Lemma 3.2 allows us to 'remove' the YljcT both sides of the equations (and set T = 0) 
by losing a factor of 2. Since we allow constant slack, the claim follows. □ 

We refer to the constraints (1) and (2) as the size and the density constraints respectively, 
because the former says that we should pick only a /c-subgraph, and the latter says the minimum 
degree (density) is at least d. The assignment we described allows us to prove the density constraint 
easily. 



Lemma 3.4. (Density Constraint) The xs described above satisfy constraints (2). 

^jer(i) 



Proof. Let S C V and i £ S. We need to check that X^,gr(i) ^suj ^ ' ^s- ^^^^ 



— 1/4 

that for every j G r(i), st(5 U j) < st(5) + 1, and thus xsuj ^ — ■ xs (the L term is due to the 
dependence on |5| in (5)). Since there are at least n^/^logn/2 terms in the LHS, the inequality 
follows. □ 

3.3 The Size Constraint and Minimum Steiner trees in Q{n,p). 
By the above corollary, it suffices to check (noting k = n^/^) that 

E ^sui < n^^^xs for all S cV,\S\<t. (6) 

We show this by proving that st{S L) i) > st{S) + 2 for most i £ V, in particular we bound the 
number of exceptions (lemmas below state the precise bounds). This then implies that (6) holds. 

We start with some basic facts (and notation) about Minimum Steiner trees (minST) of ^(c V) 
in Q{n,p), with our parameters. We will refer to the vertices in S as the terminals, and the rest 
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of the vertices in a minST as the non-terminals. First, the minST must have all its leaves to 
be terminals. Further, since every two vertices in G have a path of length two, we must have 
51(5") < 2151 — 1 for all S. This helps us bound the number of tree structures the minST of S can 
have. We define this formally. 

Given S gV, a tree structure for 5 is a tree T along with a mapping g : V{S) — t- V(T) which is 
one-one (not necessarily onto). The vertices in T without an inverse image in S are called internal 
vertices and the rest are also called fixed vertices. A tree structure for S is valid if it is possible 
to 'fill in' the internal vertices with distinct vertices from V such that all the edges in the tree are 
also present in G. [The relation to Steiner trees is apparent - the internal vertices are the Steincr 
vertices]. Given an internal vertex in T, the vertices of G which take that position in some valid 
'filling in' are called the set of candidates for that position. 

Before we get to the lemmas, we note that the number of tree structures for S of size < 2|5| is 
at most (2|S'|)^I'^I (this is just by a naive bound using the number of trees). Let us now bound the 
number of i e V for which st(S' U i) < st{S) + 1. 

Lemma 3.5. Let S G V and T be a tree structure for a min Steiner tree of S (so the leaves of 
T are elements of S). Then the number of candidates for each of the positions in T is at most 
(logn)2|^l. 

Proof. The proof is by induction on the size of S. The base case jS"! = 1 is trivial. Assume the 
result for all tree structures of sets of size < l^l — 1. Now consider S. We may assume that T has 
at least one non-terminal, as otherwise there is nothing to prove. 

First, note that there exists a vertex u G T which is adjacent to at most one non-leaf vertex in 
T. This is because deleting all the leaves in T gives a tree (which is not empty as there is at least 
one non-terminal in T), and a leaf in this tree our required u. If tt is a terminal, we could remove the 
leaves attached to u (thus obtaining a subset S' of the terminals) , and the remaining tree structure 
would be a valid min Steiner tree for S' . Further, the set of non-terminals is precisely the same, 
and thus the inductive hypothesis implies the claim for S. Thus suppose « is a non-terminal. 

If the degree of u (in T) is 2, then u has precisely one leaf attached to it (call it tj. Consider 
the tree T' obtained by removing «, and let S' = S\l. Now T' is a min Steiner tree for S' (if not, 
we could consider use this smaller tree for S' along with a path of length 2 to £ to obtain a smaller 
minimum steiner tree for S). If b is the vertex in T' attached to n, there are at most (log n)^'"^'^^ 
candidates for 6, by Induction Hypothesis. For each candidate 6, the number of candidate u is only 
O(log^n), and since £ is a terminal. Thus the number of candidates for u is at most (logn)^''^'. 
The rest of the non-terminals in T are also present in T', and this gives the result. 

If degree (m) > 2, then there are at least two leaves attached to u, thus the number of candidates 
for u is only log^ n. Consider one candidate x for u. Let T' be the tree obtained by removing all 
the leaves attached to u (thus u is now a leaf), and S' he S\J x minus the set of leaves attached 
to u. Now T' is a min Steiner tree structure for S' (otherwise we can obtain a smaller tree for S). 
Thus by the inductive hypothesis, the number of candidates for any internal vertex in T' is at most 
(logn)^!*^!"^. Since there are only log^n of the x's, it follows that the total #(candidates) for an 
internal vertex is at most (logn)^l'^l. 

This completes the proof, by induction. □ 

An easy corollary is the following. 
Corollary 3.6. Let S gV . There are at most {2\S\ logn)^'"^' vertices i such that st(5'LJi) = st(5'). 
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Proof. Each such i must be the internal vertex of some min Steiner tree for S, and there are at 
most (2| S'D^I'^I tree structures. Lemma 3.5 now implies the claim. □ 

Lemma 3.7. Let S C V . There are at most (4|S'| logn)^!"^! x n^/^ vertices i such that st(5 U = 
st{S) + l. 

Proof. Let i be such a vertex. First, note that if there exists a min Steiner tree for S VJ i with i 
as a leaf, we are done. This is because removing i gives a min Steiner tree for S, and thus i is a 
neighbour of an internal vertex in a min Steiner tree for S. Thus by Corollary 3.6 there are only 
{2\S\\ognf\^\ X (2n^/2logn) such i. 

Thus suppose that the min Steiner tree for S* U i has i as an internal vertex. We will prove the 
bound as follows: we consider a tree structure T of size st(5) + l with leaves being terminals from S\ 
then we show that the number of candidates for any fixed position in T is at most (2|S'| log n)'^^^^n^/'^ . 
This suffices, because the number of choices of tree structures adds an additional factor of (2|S'|)^I'^L 




Figure 4: An example 

Let us consider a structure T as above, and a position u. Since u is not a leaf, it has degree 
at least 2. Let the degree be d, and let Ti, . . . , be the subtrees of T formed by removing u (see 
figure ...). Now if for some z, Tj is the min Steiner tree for the terminals in T,, we are done, because 
then, each candidate for u must be neighbour of an internal vertex in the tree, and by Corollary 3.6 
there are only y/n x (2|S'| logn)^'"^' candidates. Thus for each i, Ti must have a strictly smaller tree 
T-. Let the vertex in Ti connected to u be called bi. Now construct a new tree as follows: leave 
Ti intact, and replace T2, . . . , by Tg, . . . , T^; connect bi to Tg, . . . , using paths of length 2. 
The number of edges in the new tree is now at most \T\ — d — {d — 1) + 2{d — 1). The first term is 
the original cost, followed by removal of u, followed by the decrease by using T- as opposed to Ti, 
followed by the cost of adding length-2 paths. 

Thus the new tree has cost at most |T| — 1, and thus it is optimal for S\ Further, u is adjacent 
to 61 which is an internal vertex, and thus the number of candidates is bounded by the desired 
quantity. □ 

Putting things together. Consider the sum ^^^y xsui- Corollary 3.6 implies that there are at 
most (Llogn)^ terms which contribute a value xs/L. Lemma 3.7 implies that there are at most 



9 



n^/^ • (2Llogn)^^ terms which contribute a value xs/{n^^'^L). Thus if we pick (2Llogn)^^ < n^/^, 
we have the bound that the sum is at most n^^'^xs, as desired. 

Thus we have verified each of the constraints (l)-(3). This completes the proof of Theorem 3.1. 



3.4 Gaps for the mixed hierarchy (SA+). 

Consider the relaxation SAt described in (l)-(2), along with the constraint: Z = ixij)i<ij<n 
The solution considered earlier (Equation (5)) turns out to also satisfy this PSD condition with 
high probability. The entries of Z are 



Thus we have 



Zi. 



z 



n 



^1/2 



/L if i = J 



n^"^ / L? otherwise 



1 
L 



riL ~'~ n^/^ ~'~ 



1 



3/4L 



A 



n 



where A is the adjacency matrix of G. Now ^ is a Q(n,p) matrix with p = n~^/^ logn. Thus the 
least eigenvalue is at least — 2y^np(l — p) with high probability (by the Semicircle law). This is at 
least — 4n"'^/^(logn)"'^/^. Thus we have A + 4n^/^-v/log nl >z 0. Using the fact that J ^ 0, we obtain 
that Z ^ 0. 

This shows that adding an SDP constraint at the first level does not give us any additional 
power - the relaxation obtained after ^{\^^^) levels also has an integrality gap of Vl{ti}^^). 



Question 3.8. Does Theorem 3.1 hold for L S> logn? 

We conjecture that even L ^ levels does not reduce the integrality gap substantially. We 
need a different approach (involving a better argument for bounding the number of trees) to extend 
the arguments above to this range of L. 



4 Integrality Gap for the Lasserre hierarchy 

In this section, we show a gap instance with arbitrary large constant ratio for linear-round Lasserre 
relaxation, and a gap instance with ratio for n^~'^^^-'-round Lasserre relaxation (Theorem 4.7). 
We also aim at maximizing the ratio of a polynomial-round Lasserre gap instance, getting a ratio 
ofQ(n2/53-£) (Theorem 4.8). 

Our construction is based on a variant of Tulsiani's gap instance for Max K-CSP [Tul09] - we 
extend the parameter range of Tulsiani's instance. Then we convert the Max K-CSP instance to a 
constraint-variable graph and duplicate the variable vertices, which is our gap instance for Densest 
A:-subgraph. Note that the gap for Max i^-CSP problem is indeed a set of random instances. 
The vector solution from Lasserre gap for Max K-CSP will help us exhibit a good Lasserre vector 
solution for Densest A;-subgraph. We finally use the structure of random instances of Max i^-CSP to 
show the soundness holds with high probability. 

Now, let us proceed to the first step, the gap instance for Max i^-CSP. 
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4.1 Lasserre Gap for Max K-CSP from [Tul09]. 

We start by defining the Max K-CSP problem. 

Definition 4.1. Let C Ff he a q-ary linear code of block length K. 

1. An instance $ of Max K-CSP{C) is a set of constraints Ci, C2, ■ ■ ■ , Cm where each constraint 
Ci is over a K -tuple Ti = (xj^ , , • • • , ) , and is of the form (xj^ + b^(^ , xi^ + 63 ^ , • " " ) ^ik + 

G C7 for some 6« G Ff . 

2. A random instance o/Max K-CSP{C) is sampled by choosing each constraint Ci independently, 
where we sample K variables without replacement from [n] to get Ti = (xj^jXjj, • • • o.'^^ 
6*-*^ is chosen from F^ uniformly. 

The following theorem is an extension of the main theorem in [Tul09] , showing that polynomial- 
round Lasserre relaxation cannot refute random Max i^-CSP with high probability. 

Theorem 4.2. If C is the dual code of a distance 25 > 3 code (in terms of number of coordinates, 
not fractional distance), for every 10 < K < n^/^, if n'^-^ < V < 1/(10^ • (/3if25+o.75)i/{5-i)-) 
for some k > 0, then for large enough n, a random instance <^ of Max K-CSP{C) over m = /3n 
constraints and n variables, with probability 1— o(l), admits a perfect solution for the SDP relaxation 
obtained by rjn/16 rounds of the Lasserre hierarchy, i.e. there are vectors V(s,a) for 0,11 S C [n] with 
\S\ < rjn/lQ and all a : S ^ ¥q, such that 

• the value of the solution is perfect: J^iLi J2a-Ti^¥q Ci{ct) \\V'{Ti,a)\\'^ = fn; 

• {V{S^,a^),y {82,0,2)) >0 for all Si,S2,ai,a2; 

• {y{Si,ai),y{S2,a2)) = ifai{SinS2) / a2(5'i H 52)/ 

• {V{Si,ai), y{S2,a2)) = C^(53,a3)' ^^(54,04)) /^^ U ^2 = U S4 and Ui o 02 = o 04; 

• 11^(0,0)11^ = 1 SjeF, = 1 for all i £ [n]. 

Note that Theorem 4.2 extends the original theorem of [Tul09] to the regime where K might 
be superconstant (even poly(n)). The proof of Theorem 4.2 follows the proof in Tulsiani's paper, 
with the following changes. 

Observation 4.3. By the first property of the solution given in Theorem 4-2, we know that for 
every ie [m], we have Eq:T,->f, Ci{a) WVr^af = I , and therefore Ea:T,^F, Ci{a)V^T„a) = V(0,0). 

Recall that Tulsiani showed that, if the constraint-variable graph of a Max K-CSP{C) instance 
has very high left-expansion, then the Lasserre SDP admits a perfect solution for it. Formally, the 
following lemma is (implicitly) shown in [Tul09]. 

Lemma 4.4 ([Tul09]). Given a Max K-CSP{C) instance, if every set of constraints of cardinality 
s <r involves more than {K — S)s variables (where 26 is the distance of the dual code of C), and 
if 4:5 < K, then there is a perfect solution for the SDP relaxation obtained by r/16 rounds of the 
Lasserre hierarchy. 
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Hence, we only need to prove the following lemma which shows that the constraint-variable 
graph still has very high left-expansion, even when a constraint might involve superconstant many 
variables (i.e. the left degree might be superconstant). 

Lemma 4.5. Given f3,r],K as in Theorem with probability 1 — o(l), for all 2 < s < r]n, every 
set of s constraints involves more than {K — 5)s variables. 

A similar lemma can be found in [Tul09] (Lemma A.l), which only deals with constant K. We 
need a more refined argument for superconstant which is in Section 4.4. 

4.2 The Lasserre gap for Densest fc-subgraph. 

The gap instance is reduced from the gap instance for Max i^-CSP in Theorem 4.2. Let C be the 
dual code of a [K, K — t, 25]q code as used in Theorem 4.2, where K is the block length, {K — t) 
is the dimension, and 25 > 3 is the distance of the code. Such a code has size \C\ = q^, and is 
very sparse for small enough t. For 1000 < q and K > q'^ , we let /3 = (40q'*"'"^ In q)/K, and do the 
following reduction. 

Given a Max K-CSP{C) instance $ with m = /5n constraints and n variables. Let G<j> = 
[L^^R^,E^) be the bipartite graph with m\C\ left vertices and nq right vertices. For every con- 
straint Ci and every partial assignment to variables in the corresponding tuple Tj which satisfies the 
constraint Cj, we introduce a left vertex. For every variable Xi and its corresponding assignment, 
we introduce a right vertex. Formally, 



We connect a left vertex (Ci,a) and right vertex {xj,a') when Xj G Tj and a' is consistent with a, 
i.e. 



Now we define the final graph G'^ = {L^, R'^,E'^) in which we want to find a dense fc-subgraph 
where k = 2m. We take (3 copies of the right vertices in to get R'^. To get E'^, we connect a 
left vertex u £ and a right vertex v G R'^ if u is connected to w's corresponding vertex in R^ in 
The graph G'<^ has N = m\C\ + /Snq = 0{nq^^^'^hiq/ K) vertices. 

In our analysis of the reduction, we need a g-ary linear code C that has a small constant distance 
(but no less than 3), small block length (but more than q), and very high dimension. Thus, we 
instantiate the code C with Generalized BCH codes given by the following. 

Lemma 4.6 (Generalized BCH Codes). For every prime tower q, and integer 28 > 3, there are 
q-ary linear codes of block length K = q^ — 1, dimension {K — A5 + 3), and distance at least 26. 

We include a simple proof of Lemma 4.6 as follows. 

Proof. Let 7 be a primitive element of Fg2. Let D = 25 for notational ease. We construct the 
following code 




{{Ci, a)\i e[m],a:Ti^ ¥g, Gi{a) = 1} 
{{xj,a)\j G [n],a : {xj} ^¥g}. 



{{{Ci,a), {xj,a)}\{Gi,a) G L$,Xj G Ti,a'{xj) = a{xj)}. 



C = {(C1,C2, • • • 



, v_i) G Ff -i|c(l) = c(7) = c(7') = • • • = c(7^-') = 0, 
where c{X) = ciX + C2X^ + cgX^ + • • • + Cg2_iX'?'-^}. 
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We first show that the distance of C is at least D. Since C is a linear code, we only need to 
show that every non-zero codeword has weight at least D. 

We show the contrapositive statement : the only codeword of weight at most D—1 is 0. For every 
codeword of weight at most D — 1, suppose the non-zero entries are in the set {cj^ , Cjj, Cjg, • • • , CiQ_^}, 
we have 





+ 


^12 


+ 


Ci3 


+ 


•• + 




=0 




+ 




+ 




+ 


•• + 




=0 




+ 




+ 




+ 


•• + 




=0 



T^^-'^^^Q, + j^^-'^'^c., + 7(^-')^^Q3 + ••• + 7(^-')^--^Q,_, =0 

Note that the coefficients form a Vandermonde matrix (which has full rank). Therefore we have 
Cii = Q2 — '^ia — ' " — ^-iD-i ~ ^-^^ codeword is 0. 

Now we show that the dimension of C is at least {K — 2D + 3). Note that each constraint 
^(7*) = 0(1 ^ i ^ D — 2) can be implemented by 2 linear constraints in (since 7* E ^q^), 
while the constraint c(l) = is indeed a linear constraint in F^. Therefore, we need at most 
2{D — 2) + 1 = 2D — 3 linear constraints for C, i.e. the dimension of C is at least {K — 2D + 3). 

Finally, if the dimension of C is more than {K — 2D + 3), we can take a linear subspace of C 
of dimension (K — 2D + 3), while the distance of the subspace code is no less than the distance of 
C. □ 

4.3 Analysis. 

We get a family of gap instances G'^ parameterized by g > 1000 and 26 > 3 (using Lemma 4.6). 
We obtain our two main results of this section by picking appropriate parameters for code C as 
follows. To get lasserre integrality gaps for Ari-O(^) levels , we show the following by setting the 
distance 26 = 3. 

Theorem 4.7. For every 1000 < q < N"" (where e is an absolute small constant), there is a gap 
instance of ratio ^{q) for N/q'^^^^ -level Lasserre SDP. The same construction also works for the 
Min degree Lasserre SDP, when q = r2(logn) and q < N''. 

We now aim at getting a gap instance of ratio for polynomial-round Lasserre SDP, where e is 
maximized. By setting q = n"' for some small constant 7 > 0, the distance 26 = 4, and optimizing 
the other parameters, we obtain the following (refer to section 4.3 for details) 

Theorem 4.8. For small enough k > 0, there is a gap instance of ratio A^2/53-0(k) ^ys, 
round Min degree Lasserre SDP. 

The two theorems follow because of Theorem 4.2, Lemma 4.9, Lemma 4.11 (completeness) and 
Lemma 4.12 (soundness). In the completeness case, we will use our r-level Lasserre solution for 
Max K-CSP to show that the Lasserre SDP after R = r/K levels of the hierarchy has value at 
least j3mK. In the soundness case, we show that with probability 1 — o(l), the graph G'^ does not 
have any 2m-subgraph of value more than 17 /q times the SDP value (Lemma 4.12). Therefore, the 
graph G'q, is a gap instance of ratio ^[q) for i?-round Lasserre SDP. We proceed by first proving 
these lemmas. 
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Completeness. 

Lemma 4.9. // the Max K-CSP{C) instance $ admits a perfect solution for r-round Lasserre SDP 
relaxation, then the r/K-round Lasserre SDP relaxation for the Densest A;-subgraph instance G'^ 
has a solution of value fimK . 

Proof. For any set S C L$ U i?'^, suppose the left vertices included in S are 

(Cn,ai), (Q2>"2), • • • , (Ci^^,ari), 
and the right vertices included in S are 

where ri + r2 <r/K. Let 

5' = U Ti, U • • • U Ti^^ U {Xj,} U {x,J U • • • U 

We have |-S"| < Kr\ + r2 < r. If all the partial assignments aj's and a^'s are consistent to each 
other (i.e. there are not two of them assigning the same variable to different values), we can define 

a = CKi O a2 O • • • ttri o Q;^ o o • • • ) 

and let Us = Vi^s',a)j we let Us = 0. 

Observation 4.10. For every 5 C L$ U R'^, we have {V(0^0),Us) = \\Us\f ■ 

Proof KUs = V^s',a) for some S',a, we have {V^^fi),Us) = {V(^^,$),V(^s',a)) = ||V(5',a)f = Wsf- 
If Us = 0, we have ( V(0,0) , Us) = \\Usf = Q. □ 

We can check that all the Lasserre constraints are satisfied. 

• For two sets 81,82, either at least one of the vectors Us^,Us2 is (therefore their inner- 
product is 0), or Us^ = Vs'^^ai,Us2 = Vs^,a2 for some 8[,S'2,ai,a2 and {Us^,Us.,) = 

{Vs'^,a„Vs:^,a,)>0. 

• For any Si, S2, S3, 84 such that S*! U ^2 = S^U S4, either the set of partial assignments in 

S*! U 5*2 = 5*3 U 84 arc consistent to each other, in which case we have UsiUS2 = ^3^084 = Vs,a 
where 8 is the union of all the variables included in 81 U 5*2 and a is the concatenation of the 
partial assignments in U 82; or we have UsiuS2 = Us^uSa = 0- 
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• For every set S, we have 

(Ci,a)eL4> (a:j,a)e-R^ 
m n 

m n 
«=1 i=l a:{a'j}-s>F, 

=(m + /3n) \\Usf = 2m ||C/sf , 

where the third last equahty is because of Observation 4.3, and the second last equality is 
because of Observation 4.10. 

2 II ||2 

• Finally, we have \\Ufij\\ = = 1- 
Now, we calculate the value of the solution 

E II^mII^ 

m 2 

=/^E E E \\^m,a),{xj,a^^^^y)} 

i=l a:Ti^¥q,Ciia)=lXj£T, 
m 

=/3E E ^ii%.«)ir 

i=l a:Ti-^¥q,Ciia)=l 
m 



=/3 ^ K = /Sm/C. 



□ 



If we add the constraint (4), we can still get a good SDP solution for the Min degree Lasserre 
SDP with high probability, as long as q is superconstant. 

Lemma 4.11. For q = r2(logn), with probability 1 — o{l), this vector solution also satisfies the 
added constraint (4) with d = I3K/2, i.e., for every set S, for each vertex u, we have 

ysyu ev, Yl ^^{«.-}' ^s) > PK/2 ■ {U{^}, Us). 
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Proof. For each left vertex (Ci,a), we have 

=P (t^{(C.,a),(x„a|{,.j)},t^S> 

For each right vertex (xj, a'), we have 

v&r{{xj,a')) 

v.TiBXj a:Ti^¥q,Ci{a)=l,a{xj)=a'{xj) 

= Y Y {U{(C\,a)},Us) 

i:Ti9Xj a:Ti^¥q,Ci{a)=l,a{xj)=a'{xj) 

= Y Y {U{iC,,a)},Us), 

v.TiBXj a:Ti^¥q,a(xj)=a' (xj) 

where the last equality is because we know that f/{(Ci,a)} = when Ci{a) / 1. By the property of 
Lasserre vectors, we know that for each i G [m] , 

Y ^m,o.)} = ^{(^.,«')}' 

a:Ti^¥q,a{xj)=a'{xj) 

therefore 

Y i^{{xj,a'),v},Us) = Y i^{(x,,a')},Us). 
v€T{{xj,a')) v.Ti^Xj 

For q = il(logn), the expected number of constraints containing Xj is (3K = il((logn)*^^) = 
r2(logn), by our choice of /3. Therefore, by Chernoff bound and union bound, with probability 
1 — o(l), for all Xj, there are at least liK/2 constraints containing Xj, and for every S, xj and a' , 
we have 

Y Us) > W2 • {U{^x^^a')},Us). 

ver{{xj,a')) 

□ 

Soundness. 

Now, we show that random instances of Max K-CSP give rise to graphs G'^ whose 2m-sized 
subgraphs have density 0{f3K/q). Note that the large alphabet size q allows us to get a much 
larger gap than we would starting from random AND instances [Fei02]. This allows us some slack 
in the size of the subgraphs we need to argue about. 

For C the dual of a [K, K — t, 26]q code, we prove the following soundness lemma. 



16 



Lemma 4.12. When /3 > (40g*"'"^ lng)/i^, for a random Max K-CSP{C) instance with probabil- 
ity 1 — 0(1), any subgraph of G'^ obtained by choosing 2m left vertices and 2m right vertices contains 
at most n(5mK/q edges, and therefore any 2m-subgraph of G'^ contains at most 17 f3mK/q edges. 

Note that G'^ was constructed by taking /3 copies of the right bipartition and rephcating the 
edges. To prove Lemma 4.12, we only need to prove the following lemma. 

Lemma 4.13. Suppose that q > 1000, K > q'^/2,t < 10. When j3 > {AOq^'^'^ In q) / K , for a random 
Max K-CSP{G) instance with probability 1 — o{l), any subgraph of G^ obtained by choosing 2m 
left vertices and 2n right vertices contains at most llmK/q edges. 

Proof of Lemma ^.12 from Lemma 4-13. We only need to prove once there is a 2m x 2m subgraph 
of G'^ with t edges, there is a 2m x 2n subgraph of G$ with at least i//3 edges. Fix 2m left 
vertices in G'^, to maximize the number of edges in the subgraph, we need to select the 2m right 
vertices with most edges connected to the chosen 2m left vertices. Since any two right vertices G'^ 
corresponding to the same right vertex in G$ have the same set of neighbors, there is an densest 
2m X 2m subgraph H' of G'^ that, for any two such vertices, chooses either both or neither of them. 
Now we define an subgraph H of G$ that contains the same 2m left vertices. It contains a right 
vertex if any copy of the vertex is contained in H' . H contains 2m/ (3 = 2n vertices, and it is easy 
to see that there are (at least) t/P edges in H. □ 

We proceed by fixing a set of 2n vertices R on the right. Lemma 4.13 follows from the following 
claim by a standard union bound over all possible choices of R. 

Claim 4.14. Recall that Gq, = {Lq,,Rq,,Eii>). Suppose that q > 1000, if > q^/2,t < 10. Fix a subset 
R C i?$ (note that R^ is the same for all the instances ^ of n variables), \R\ = 2n, the probability 
( over choice of ^) that there does not exist a subset L C L$ of size 2m such that the number of 
edges in the induced subgraph by LU R is more than llmK/q, is at least 1 — ex.p{—mK / {lOq^'^'^)). 

Proof of Lemma 4-13 from Claim 4-14- Since there are only (2^) < exp(2n(lng + 1)) choices of R, 
by a union bound, with probability at least 

1 - exp(2n(lng + 1)) • exp(-mir/(10g*+2)) 

= 1 - exp(2n(lng + 1) - /3nif/(10g*+^)), 

there is no 2m x 2n subgraph of G^ containing more than 17mK/q edges. The probability becomes 
1 - 0(1) when P = (40g*+2 /x. □ 

Proof. (Proof of Claim 4.14) First, we show that with high probability, a constraint Cj is "poorly 
satisfied". That is, none of the left vertices corresponding to a constraint Cj has more than ^l(K/q) 
neighbors in ii - this number is roughly 1/q times the corresponding value in completeness case. 
We prove this in the following two steps. 

Step 1. Fix a subset R C \R\ = 2n, for each variable Xj, let deg(xj) be the number of vertices 
in R that corresponding to xj, i.e. let deg(xj) = \Rri {{xj,a)\a : {xj} — )• ¥g}\. For a subset of 
variables T C {xi,X2, • • • ,Xn}, let deg(r) = X^j, deg(xj). We call T good if the average degree 
of variables in T is not more than 4, i.e. deg(T) < 4|r|. 
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For a random T with |T| = K, note that the expected degree E[deg(T)] = 2K. Therefore, by 
Hoeffding's inequalities for samphng without replacement (Theorem 1 and Theorem 4 in [Hoc62]), 
we have 

Pr[r is not good] = Pr[deg(T) > AK] < exp(-ln(4/e) • 2K/q) < ex.p{-K/{2q)). 

Step 2. Again, fix i? C R^, T C {xi, X2, ■ ■ ■ ,Xn}, for a codeword a on coordinates in T, i.e. 
a : T — )• Fg, let agrj.(a, R) = \{{xj, a\xj)\xj & T} Ci R\. For a constraint Cj, say it is poorly satisfied 
if for all a : T ^ ¥q such that Ci{a) = 1, we have agr'p.(a, i?) < 8K/q. 

Recall that to sample a random constraint Ci, we first sample a random i^T-tuple Tj, and a 
random shifting function b^^\ Note that for a fixed q : T — )■ Fg, and a fixed Tj that is good, when we 
take a random shifting function b^'^^ : Ti — )• Fg, we have E^(i)./p^_j^p^[agrjv(a — i?)] = deg(Ti)/g = 
4:K/q, therefore, by standard Chernoff bound, for a fixed codeword a G C, the probability that a 
makes Ci not poorly satisfied is bounded from above by 

Pr[agr7,.(a - b^'\R) > 8K/q] < exp(-ln(4/e) • iK/q) < exp{-K/q). 

Since there are |C| = g* < q^^ codewords, by a union bound, for K > g^/2 and q > 1000, we have 

Pr[Cj is not poorly satisfiedlTj is good] < q^^ ■ exp(— K/g) < exp{—K/{2q)). 
In all, we have 

Pr[Cj is poorly satisfied] 
> Pr[Cj is poorly satisfied |Ti is good] • Pr[r is good] 
>(1 _ exp{-K/q)){l - eM-K/{2q))) 
>l-eM-K/m). 

Now, again, by standard Chernoff bound, we have 

Pr[|{Cj|Cj is not poorly satisfied}! > m/{q ■ \C\)] 
<(e-|C7|-g-exp(-/^/(3g))r/(''-l^l) 
<(e-g^0-g-exp(-K/(3(?)))™/^^-l^l^ 
<exp(-i^/(10g))'"/(''-l^l) 
= exp(-mi^/(10g*+2)). 

By the calculation above we know that with probability at least 1 — e'x.p{—mK / (lOg*"^^)), there 
are at most m/{q ■ \C\) constraints that are not poorly satisfied. 

For each left vertex {Ci, a) £ L^, if Ci is poorly satisfied, we know there are at most 8K/q edges 
from (Cj, a) to R. If Ci is not poorly satisfied, there are at most K edges to R^ - this upperbound 
also applies to R. 

Therefore, with probability at least 1 — exp(— mi^'/(10g*"^^)), any set of 2m left vertices has at 
most 2m ■ 8K/q + m/{q ■ \C\) ■ \C\ ■ K < llmK/q edges connected to R. □ 

We now complete the proofs of the main theorems in this section. 
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Proof of Theorem 4.7. 



By combining Theorem 4.2, Lemma 4.9, Lemma 4.11 (completeness), and Lemma 4.12 (soundness) 
we see that with probabiHty 1 — o(l), the graph G'^ provides a il(g) integrahty for the number of 
levels R given by 



n 



K(/3i^25+0.75)l/(<5-l) 

N 

q2t+2 . (^i^25+0.75)l/(5-l) 

N 



^ J^(2<5-0.25)/(<5-l)g(2t+2)+(t+2)/(5-l)poly Jog 

(recall that /3 = (40^*+^ lng)/i^) 
Recall that K = — 1. By setting K = — 1 and 26 = 3, we verify that the theorem holds. □ 

Proof of Theorem 4.8. 

Let q = n*, since N = 0{nq'^*~^'^ In q/ K) = 0{nq'^* Inq), ratio of the gap due to Lemma 4.9 and 
Lemma 4.12 is 

n{q) = f)(Ar7/(i+2i7+o(i))) = f](jvT/(i+(8'5-6)7+o{i))). (^ote that t = 4,5 - 3) 

This means that 

_ 7 _J: l + o(l) 

^ ~ 1 + (8(5 - 6)7 + 0(1) ~ 8<5 - 6 (1 + (85 - 6)7 + o(l))(8(5 - 6) ' 

Note that when 2(5 > 3 is fixed, e is maximized when 7 is maximized. 
The number of rounds (due to Theorem 4.2 and Lemma 4.9) is 

R = n ^ 
= n 

= Q 



_^(25-0.25)/{5-l)g(2t+2)+(t+2)/(5-l)polylog 

N 

^27(25-0. 25)/(<5-l) . ^7((8<5-4)+(45-l)/{5-l))+o(l) 

(by K = e{q^), q = n"',t = A6 -3) 
N 



^7{8<5+4+6.5/(<5-l))+o(l) 

. f(8S+4+6.5/{S-l))+o{l) 
Q { N l + o(l) + 7(8«-(i) 



For very small k > 0, to get a gap instance for A^^^'^^-round Lasserre, we need 

1 + (5(1) + 7(8(5 - 6) - ^ ' 
^1 + 0(1) + 7(85 - 6) - (7(8(5 + 4 + 6.5/{6 - 1)) + o{l)) > n{K) 
^1 - 7(10 - 6.5(5 - 1)) > n{K) 

1 - n(K) 

=^7 < — — 

' - 10-6.5((5-l) 
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Let 7 = io+6.?/(?-i) ^ we have 



l + o(l) 



8(5-6 (l + (8(5-6)7 + o(l))(8(5-6) 
1 l + o(l) 



85-6 (1 + (85 _ 6) ,o^^-gg_,) + o(l))(8<5 - 6) 
1 1 

0(k). 



^ 10+6.5/(5-1) >y°" ^) 

When 25 = 4, we get the maximized value e = 2/53 — 0(k). □ 

4.4 Expansion for random Max K-CSP instances. 

In this section, we prove Lemma 4.5, restated as follows. 

Lemma 4.5 (restated). Given /3,rj,K as in Theorem 4^.2, with probability 1 — o(l), for all 
2 < s < rjn, every set of s constraints involves more than [K — 5)s variables. 

Proof. Fix 2 < s < r/n, let us upperbound the probability that there is a set of s constraints 
containing at most {K — 5)s variables. Since there are C^") such sets, the probability is at most 

I3n\ 

\ Pr[the first s constraintscontain at most {K — 6)s variables] 

J Pr[the first s constraints contain exactly i variables]. 

Fix a set T of i variables, let p{s,i) be the number of s-tuples (Ti,T2,--- ,Ts) where for each 
1 ^ J ^ S) Tj is a set of K variables, such that Ui<j<sTj = T. We have 

Pr[the first s constraints contain exactly i variables] ~ ^ - ^ ' ^ (^j<^ 

To upperbound p{i,s), we view the way to enumerating valid {Ti,T2,--- ,Ts) as, to choose a 
multiset of Ks variables (each one from T) so that each element in T appears at least once in the 
multiset, then view each element in the multiset as a distinct element, and distribute these Ks 
elements to s sets, in a balanced way. Note that in this way, we are able to enumerate all the valid 
s-tuples (although some of them might be enumerated more than once). Since there are at most 
(^!~L^) < (^^) valid multisets, we have 

p{i,s)<(^'yKs)l/iKlY. 



Therefore, we have 



Pr[the first s constraintscontain at most (K — 6)s variables] 



^;),..„.,.-o-(;)-"g (';)(- 
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Note that when K'^s < Sn and i < {K ~ 6)s, we have i < nKs/{n + Ks) (since i < Ks{l — S/K) < 
Ks/{1 + 6/K) = nKs/{n + 6n/K) < nKs/{n + Ks)), and therefore 



^ — ^ /X (n - i)(iirs - i) > r ni^Ts > (n + 



therefore the function ( is increasing when i < (K — 6)s, therefore 



^ ( n \fKs 



forK < nV2, we use the fact that (|^) > {n-K)^ /K\ > /3/{{K/e)^-{5^/K)) = (en/K)^ /{15^/K) 
(since by Stirhng's formula, we have K\ < 5VK{K/ e)^), and again use the fact that \/2ttK{K/ e)^ < 
K\ < 5y/K{K/e)^ , we bound the expression above by 



[ s ) iV2^{K/e)^)s ■ ^''^'^ [en) ) [iK-S)s) [ 5 ) 



Ss 



<5{Ks)^-^ 
<5iKs)^-^ ■ { 



2^nS-^K - S)K-^S^ 



For 2 < s < In^n, since n'^-^ < 1/(10« • (pK^S+o.75^l/{S-l)^^^ ^^^^ ^2^45+1.5/^2(5-1) < 
^-(25-i)k^ we have 



27rn'5-i(5'5 



<5(iCs 



,1.5 



-^/2^(55n2(5-l)'^ - ^ ^ 



For In^n < s < r/n, since r? < 1/(10^ • (/3itC25+o.75')i/(5-i)-)^ ^ < 1/(10« • (;9K2'5)V('5-i)), and 
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further we have /3i^25^5-i < ^^/(loo • ISe^+Vv^) for all 5 > 5/4. Therefore, 



<5 • 



V100(r/n)'5- 



<5{Ks 



1 

.2 



100(7/n) 



(5-1 



<5 • ^^j^J • (by s < rjn and iiTs < n^) 

Now, we upperbound probability that there exists a set of constraints of size s < r]n involving 
at most (K — 6)s variables by 

s=2 ^ 

In^n 1+5^ 5-1 7^2,5+0.5 \« J?" / 1 c „l+5/0 5-1 7>'25+0.5 \ ^ 

Y^ms^- r" fli/ — )+ E ^t./ — ) 

In^ n rjn 

<Y^Oin-^/')+ 5.(l/50r = 0(1). 

«=2 s=ln^ n+1 

□ 



5 Conclusion 

In this paper, we show integrality gap lower bounds of J7(n^/'*/ log^ n) for r2(log n/ log log n) levels of 
the Sherali-Adams+ SDP relaxation, and Cl(n'^/^^~^) for n^^*^^ levels of the Lasserre SDP relaxation 
for the Densest A;-subgraph problem. 

The gap instances for SA+ SDP are actually (Erdos-Renyi) random graph instances Q{n,p). We 
believe these instances should give ri(n^/^~'^) gaps for even stronger relaxations - in particular higher 
levels of the Sherali-Adams hierarchy, with stronger SDP constraints. The sub-exponential time 
algorithms for Densest /c-subgraph in [BCC^IO] imply that the integrality gap becomes 0{n^^'^~^) 
after n'^^^-' levels of an LP hierarchy which is weaker than the Sherali-Adams hierarchy. In fact, 
these sub-exponential time algorithms were inspired by attempts to construct integrality gap lower 
bounds for many levels (polynomial in n). It would be interesting to close this gap by obtaining 
matching integrality gap lower bounds for n^^^^ levels. As a further goal, one might also hope to 
combine the techniques used in both parts of this paper, to get 0(n^/^~'^) gaps for polynomial levels 
of the Lasserre hierarchy. 
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A Random graph properties 

We prove that the properties used in our gap construction hold for Q{n,p), withp = n^^/^(log n)^/^. 
These properties are listed in Section 3.1. In what follows fix p to be the value above. As mentioned 
in Section 2.1, the phrase "with high probability" (w.h.p.) refers to 'with probability at least 
1 — where q{n) is an arbitrary polynomial in n (sometimes there will be a constant depending 
on the polynomial). 

Lemma A.l. Every vertex of G has degree between {n^^'^logn)/2 and 2n^/^logn w.h.p. 
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Proof. Let u & V. The degree d{u) (as a random variable) is the sum of n i.i.d. Bernoulli 
random variables each having parameter p = n~^/^logn. The expected value is thus n^/^logn. 
This is 3> logn, and thus by Chernoff bounds, the probability that Pr[|d(M) — n^/^logn| > t] < 
g-t /4np(i-p) ^ for any polynomial q{n). Taking union bound gives the claim. □ 

Lemma A. 2. Every pair of vertices in G have at most 21og^ n common neighbours w.h.p. 

Proof. Let u,v £ V. Let Xi be a random variable which is an indicator for i G T{u) H T(v). In 
G{n,p), we have EXj = = log^ n/n. Thus E \T{u) n r(i;)| = np^ = log^ n. Thus the probability 
that it is > 21og^n is at most e"'°s Taking union bound over all u, v, we obtain that this is 
smaller than any polynomial. □ 

Lemma A. 3. Every pair of vertices have at least one common neighbour w.h.p. 

Proof. As above, consider some u, v; we have E \ T{u) n r(t;)| = np = log^n. Thus Pr[|r„nr(u)| < 
logn] < e-^°s "/^ (since we can use Chernoff bounds as long as the expectation 3> logn). Taking 
union bound again implies the result. □ 

Lemma A.4. No induced subgraph on rv^l'^ vertices has density > 5 logn w.h.p. 

Proof Let 5 C y of size n^/^ ^j^^^ ^E{S,S) = ("3^) ■p = n^/^\ogn/2. Further the variance of 
this quantity is (^^ )p{l — p) < n^/^ logn. Thus by Chernoff bound, 

Y'y[\E{S,S)-^E{S,S)\ >t\< e-*V4ni/2logn_ 

Picking t = 4n^/^logn, the probability upper bound is e^^** ' ^"g". Thus we can take a union 
bound over ah the (^I/a) subsets S. This proves the claim. □ 
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